K^W SEQUENCE LISTING ?o'25'52°' 
PATENT APPLICATION: US/09/871,388 TIME: 10:25.52 

Input Set : N:\Crf3\RULE60\09871388.txt 
Output set: N:\CRF3\06142001\l871388.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

4 (i) APPLICANT: Rubin, Gerald M. 

5 Pan, Duojia 
g Rooke, Jenny 

Yavari, Reza 

9 (ii) TITLE OF INVENTION? KUZ: A Novel Family of Metalloproteases 

10 (iii) NUMBER OF SEQUENCES: 10 

11 (iv) CORRESPONDENCE ADDRESS: 

■Co (A) ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

^3 (B) STREET: 268 BUSH STREET, SUITE 3200 

14 (C) CITY: SAN FRANCISCO 

15 (D) STATE: CALIFORNIA 

16 (E) COUNTRY: USA 

17 (F) ZIP: 94104 

18 (V) COMPUTER READABLE FORM: 

19 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
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OIPE 




20 

22 (D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

24 (vi) CURRENT APPLICATION DATA: 

_^ 25 (A) APPLICATION NUMBER: US/09/871 , 388 

26 (B) FILING DATE: 31-May-2001 

27 (C) CLASSIFICATION: 
29 (vii) PRIOR APPLICATION DATA: 
3Q (A) APPLICATION NUMBER: 08/937,931 
31 (B) FILING DATE: 

33 (viii) ATTORNEY /AGENT INFORMATION: 

34 (A) NAME: OSMAN, RICHARD A 

35 (B) REGISTRATION NUMBER: 36,627 
3g (C) REFERENCE /DOCKET NUMBER: B97-081 

37 (ix) TELECOMMUNICATION INFORMATION: 

38 (A) TELEPHONE: (415) 343-4341 

39 (B) TELEFAX: (415) 343-4342 

41 (2) INFORMATION FOR SEQ ID NO: 1: 

42 (i) SEQUENCE CHARACTERISTICS: 

43 (A) LENGTH: 5630 base pairs 

44 (B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: double 
4 6 (D) TOPOLOGY: linear 
47 (ii) MOLECULE TYPE: cDNA 

I = ~ iiii SEE iii = - 
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RAW SEQUENCE LISTING ^ATE: 06/1./2001 

PATENT APPLICATION: US/09/871,388 TIME : 10 : 25 . 52 

Input Set : N:\Crf3\RULE60\09871388.txt 
Output Set: N:\CRF3\06142001\I871388.raw 

rnaTTrTrrC TAAACAAAAA ATTCGACCGG AGTTCAAAAA TAAACAATCC AGTGAATAAA 300 
53 GAATTGTGGC TAAACAAAAH ^^'^ tCTTTTTTAT TTTCGCTTTT AATTTATTAA 360 

111=11=11 i 

68 TATTCGAGGG TAAAATTATA ACGGAACGTG ATGCCTATIA 'J^yJ^ aCATCCTCAG 12 60 

=S S= cSSS = ™ 

isailii i 

52 =^ =SS = ^ » ~ 

skills 1 

96 TCAACGAGGA GGAGTGCAAG GACAAGTGCT GCTACCCGC^ tGCTCACCAT 2940 

sss ??^r«ss ss» s;Sc«». ,.=tTcc™ 3.so 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/871,388 

Input Set : N:\Crf3\RULE60\0987138B.txt 
Output Set: N: \CRF3\06142001\I871388 , raw 



DATE: 06/14/2001 
TIME: 10:25:52 



102 CCTCGACCAC ACTGCCGCAC GTGAGCAAGC GCAAGTTGTG 

103 GCAATGACAC CTCCACCTGC CGCAGCACCA GCGAGTTTGC 

104 AGGGTGGTAT TAGTCTGCAG CCCGGTTCGC CATGCGATAA 

105 TGTTCCTTAA GTGTCGAGCC GTGGATGCCG ATGGTCCGCT 

106 TGCTCAACCG GAAGACCCTG CAAACGGTGG CCGAGTGGAT 

107 TGGTTCTGAT GGGAGTGGCC TTTATTGTGG TCATGGGTTC 

108 TGCACACGCC CAGTTCCAAT CCGAAGAAGC GACGAGCTCG 

109 GAGCACCCAT GAACACGTTG CGTAGAATGC AACGTCATCC 

110 CTCGAAGCAT CCCACCGCCG GCACATGAGG CGCAGCATTA 

111 GCGGCGGCGG CGGTGGAGGC GGAGGTCGCC ACGGTGGCTC 

112 ATCCGCACGA TTGGGATCGT CATCAGGGTG GCCACTCAAT 

113 GCAGCCATTC AAGTCGCAAC TCGGCGGCGA ATCAAGCGAG 

114 CACGATCCAC CAGCAGTGGG CGGCCGCAGG CTATAGCCAG 

115 GAGCAGCGCG ATCTCATGGC GGGTACGGAG CCGAACAGGC 

116 GTGGTGTCCA GGCGGCCATT AGCAGCGGCG GTGTGGTGGC 

117 CATTGCCGCC GCCAAATGGA CAGCAGCAAA TGCAACAGCA 

118 AACCGGCAAT TTCGCCGCAG CAGCAGCCGC AGCAAGCGTT 

119 CACCACGCAA TAAGTCCCGA TCATCACGTA CCAACAACAC 

120 CCAACTCATC CACAGCGGCA GCCGGCAGTG GGTCGGTCTC 

121 GCAGTAGTAG TAAGAGCAAG AGCGGTAAAA GTGCCAAAGC 

122 AATCGCAGCA GGCCAACAAC AGTCGCAGCA GCAGCAAGGA 

123 GCCGAAATAT CGTTTATTAG GAGCGGAACC ATCACATTGC 

124 ATATAGCCCC GAACCCAAAA TATCAAATGC AACCACATAT 

125 ATCGAACTAC ATGTATGAGT TGTTGCTTCC CATCCACCGA 
12 6 ATTATAATGA TATTTCATTT AATCGATGCA ATTGGCGTCG 

127 AGCTTTAGTC GGCCGACATC GTTGCACGAG CAACAGCAGC 

128 CAGCAGCAGC ATCAGCAGCA ACTGGAGCCG CAGCAGCAAC 
12 9 GCGGCCTTGG GGCGGGGCCA GTATGAGTCC ACCACGCGGG 

130 TGACAGCCAA AAGTAGCAAT GGAGCGCCAC AAAAGGCCAA 

131 CAGAAGGAGC CGCATACACA GCAAACAACA ACACAGCAAC 

132 TCAAATGAAC TCAAATTAAA TGTAAATGTA ATTTTTATGC 

133 GTGTTTGTAT GCCACAAGGG AAAACAGCCA GCAACAAAAA 

134 AAAAAAGGAG ACAAATTTCG TAATACAGAA AAAGCTGAAA 

135 ACTAAATTAA AATGAAAATA CGAATGCAAA TTATGAATAA 

136 AACATGCATA ATACATATAA AGTTGCAAGT TGCATATATA 

137 TATTATGGAT ACACAATTAT TAAATAGCAG CAGCCACAAC 

138 GAAAAACTAA GGTTTAATTG TATGAGAAAG CATTCTATAT 

139 CTAGGCCGAA ATACAAAATT AATTACACAC TTGAATAACA 

140 AAAAAAAATG AAATAAACAA AAACAGTGCG AATTAATTAA 

141 ACGGAAACAA CAAAGCATTT AAATTGTATT TATCTGTACC 
14 2 AGCCGTCAAA ATTGCATTTG TAAACTAGCA AAACAAAAAA 
14 4 (2) INFORMATION FOR SEQ ID NO: 2: 



CGACTTGGCC 
CGATAAATAT 
TTTCCAGGGC 
TCTTCGGCTG 
CGTCGACAAT 
GTTCATCAAA 
TCGAATCAGC 
CAATCAGCGA 
TTCACGCGGC 
TAGGTCACAC 
CGTCCCATTG 
AAGAAGCGAT 
CGGAAGCGGT 
GATACCGGGT 
TCGGGCCCAG 
ACAACAACTG 
CTACACGCCG 
CTCCAACACC 
GGGACCGGGC 
CAAAGACTCA 
GAAGGGCGTC 
CATACACAAC 
AGAATCGCCC 
CAAACACAAA 
CGCCGCCTCC 
AGCAACATCA 
ACGCCTATGC 
CGCCCAACAA 
AGGCTAAGCG 
AAAAGCAAAA 
TAATTATTTT 
GAAAAATACA 
GTGAATGATA 
TAAAAGTAAT 
TACATTTGTA 
AAACAAGTAA 
GTCGGTGAGA 
AAATGTGTTT 
GCGTCATTAT 
GAAGCTAAAC 
AAAAAAAAAC 



TGCCAGGATG 
AATATTCAAA 
TACTGCGATG 
AAGAATTTGT 
TGGTACCTAG 
TGTTGTGCCG 
GAAACTCTAA 
GGAGCAGGTC 
GGAGATGGTC 
CATCAACAGC 
CCCACCGGCG 
GGACGAGGTC 
GCCGCGAGCG 
TCCATTGGTG 
CTGCCGCTGC 
CAACTACAGC 
AAAGAACTAC 
ACAACCACCA 
TCGGGGGCGG 
AAGTCGCAAA 
AAGCCAGTGC 
ACTGAACGAA 
GCTGCTAGTC 
CAGAAAAGAA 
GCTACAAGTA 
TCTGCAGCAG 
CGATGCTTAT 
CAGCAAGGTT 
ACTCAAGCAG 
ACAACATAAA 
TATTTAAACA 
AAAATAACAC 
TTTTTGATTA 
TAAAAACGAC 
TGTATATATT 
TATACATGAA 
TTTCTAAGCG 
TGTACAAAAA 
AAAAAAAAGA 
GTTTATTTAA 



145 
146 
147 
148 
149 
150 
151 



(ii) 
(xi] 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5630 



2: 
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RAW SEQUENCE LISTING DATE: 06/14/2001 

PATENT APPLICATION: US/09/871,388 TIME: 10:25:52 

Input Set : N:\Crf3\RULE60\09871388.txt 
Output Set: N:\CRF3\06142001\l871388.raw 

Met Ser Ser Lys Cys Ala Phe Asn lie Val Phe Val Ser He He Phe 

153 15 10 15 

154 He He He Val Asn Gly Tyr Ala Lys Asp He Ser Gly Val Lys Arg 

155 20 25 30 

156 Gly His Glu Arg Leu Asn Glu Tyr He Ser His Tyr Glu Thr Leu Asn 
35 40 45 

Tyr Asp His Glu His He Arg Ala Ser His Asn Arg Ala Arg Arg Ser 

50 55 60 

Val Thr Lys Asp Gin Tyr Val His Leu Lys Phe Ala Ser His Gly Arg 

161 65 70 75 80 

162 Asp Phe His Leu Arg Leu Lys Arg Asp Leu Asn Thr Phe Ser Asn Lys 

85 90 95 

Leu Asp Phe Tyr Asp Ser Lys Gly Pro He Asp Val Ser Thr Asp His 

100 105 110 

He Tyr Glu Gly Glu Val He Gly Asp Arg Asn Ser Tyr Val Phe Gly 

115 120 125 

Ser He His Asn Gly Val Phe Glu Gly Lys He He Thr Glu Arg Asp 

130 135 140 

Ala Tyr Tyr Val Glu His Ala Lys His Tyr Phe Pro Thr Asn Arg Thr 

171 145 150 155 160 

172 Ala Thr Thr Thr Pro Pro Ser Thr Ser Thr Thr Ser Ser Ala Thr Thr 

173 165 170 175 

174 Val Thr Lys Ser Thr Gin Pro Thr Arg Pro Leu Ala Lys Ser Asn Thr 

175 180 185 190 

176 Ser Thr Thr Ala Val Asn Ser Lys Thr Glu Asn Phe He Lys Lys He 
277 195 200 205 

178 Ala Glu Ser Thr Thr Thr Ser Gin Gin Leu Pro Glu Tyr Thr Glu Ser 

179 210 215 220 

180 Ser Ser Ser Ser Ser Thr Thr Thr Phe Pro Pro Thr Thr Glu Tyr Phe 

181 225 230 235 240 

182 Glu Asp Glu Lys Glu Arg Asn Ala Glu Asp Glu Leu Asp Phe His Ser 

245 250 255 

He He Tyr Lys Glu Ser His Val Glu Asp Ala Tyr Glu Asn Val Arg 

260 265 270 

Glu Gly His Val Ala Gly Cys Gly He Thr Asp Glu Val Ser Gin Trp 

275 280 285 

Met Glu Asn He Gin Asn Ser Ala Val Glu Glu Leu Pro Glu Pro Met 

290 295 300 

Ser Lys Asp Tyr Gin Lys Leu His Arg Lys Gin Leu His Lys Lys Ser 



152 



157 
158 
159 
160 



163 
164 
165 
166 
167 
168 
169 
170 



183 
184 
185 
186 
187 
188 
189 
190 



191 305 
192 
193 
194 
195 
196 
197 
198 
199 
200 



310 315 320 

Ala Pro Gin Gin Gin Gin Gin Pro His Pro Pro Lys Lys Tyr He Ser 

325 330 335 

Gly Asp Glu Asp Phe Lys Tyr Pro His Gin Lys Tyr Thr Lys Glu Ala 

340 345 350 

Asn Phe Ala Glu Gly Ala Phe Tyr Asp Pro Ser Thr Gly Arg Arg Leu 

355 360 365 

Gly Ser Ser Ala Asn Val Ala Asp Trp. His Gin Leu Val His Glu Arg 

370 375 380 

Val Arg Arg Ala Thr Asp Asn Gly Ala Gly Asp Arg Gly Ser Ser Gly 
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201 

202 

203 

204 

205 

,206 

207 

208 

209 

210 

211 

212 

213 

214 

215 

216 

217 

218 

219 

220 

221 

222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 

233 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 



RAW SEQUENCE LISTING DATE: 06/14/2001 

PATENT APPLICATION: US/09/871 , 388 TIME: 10:25:52 

Input Set : N:\Crf3\RULE60\09871388.txt 
Output Set: N:\CRF3\06142001\I871388.raw 

385 390 395 400 

Glv Ser Gly Arg Gly Arg Glu Asp Asn Lys Asn Thr Cys Ser Leu Tyr 

405 410 415 

He Gin Thr Asp Pro Leu He Trp Arg His He Arg Glu Gly He Ala 

420 425 430 

Asp His Asp Arg Gly Arg Lys Tyr Glu Val Asp Glu Lys Thr Arg Glu 

435 440 445 

Glu He Thr Ser Leu He Ala His His Val Thr Ala Val Asn Tyr He 

450 455 460 

Tyr Arg Asn Thr Lys Phe Asp Gly Arg Thr Glu His Arg Asn He Arg 
465 470 475 480 

Phe Glu Val Gin Arg He Lys He Asp Asp Asp Ser Ala Cys Arg Asn 

485 490 495 

Ser Tyr Asn Gly Pro His Asn Ala Phe Cys Asn Glu His Met Asp Val 

500 505 510 

Ser Asn Phe Leu Asn Leu His Ser Leu Glu Asp His Ser Asp Phe Cys 

515 520 525 

Leu Ala Tyr Val Phe Thr Tyr Arg Asp Phe Thr Gly Gly Thr Leu Gly 

530 535 540 

Leu Ala Trp Val Ala Ser Ala Ser Gly Ala Ser Gly Gly He Cys Glu 
545 550 555 ^^U 

Lys Tyr Lys Thr Tyr Thr Glu Thr Val Gly Gly Gin Tyr Gin Ser Thr 

565 570 5/b 

Lys Arg Ser Leu Asn Thr Gly He He Thr Phe Val Asn Tyr Asn Ser 

580 585 590 

Arg Val Pro Pro Lys Val Ser Gin Leu Thr Leu Ala His Glu He Gly 

595 600 . 605 

His Asn Phe Gly Ser Pro His Asp Tyr Pro Gin Glu Cys Arg Pro Gly 

610 615 620 

Gly Leu Asn Gly Asn Tyr He Met Phe Ala Ser Ala Thr Ser Gly Asp 
625 • 630 635 

Arg Pro Asn Asn Ser Lys Phe Ser Pro Cys Ser He Arg Asn He Ser 

645 650 655 

Asn Val Leu Asp Val Leu Val Gly Asn Thr Lys Arg Asp Cys Phe Lys 

660 665 670 

Ala Ser Glu Gly Ala Phe Cys Gly Asn Lys He Val Glu Ser Gly Glu 

675 680 685 

Glu Cys Asp Cys Gly Phe Asn Glu Glu Glu Cys Lys Asp Lys Cys Cys 

690 695 "^00 

Tyr Pro Arg Leu He Ser Glu Tyr Asp Gin Ser Leu Asn Ser Ser Ala 

"710 715 f^u 

l°yl Gly Cys Thr Arg Arg Ala Lys Thr Gin Cys Ser Pro Ser Gin Gly 

725 "^30 ' ■^^ 

Pro Cys Cys Leu Ser Asn Ser Cys Thr Phe Val Pro Thr Ser Tyr His 

740 ^45 750 

Gin Lys Cys Lys Glu Glu Thr Glu Cys Ser Trp Ser Ser Thr Cys Asn 

755 760 ^765 

Gly Thr Thr Ala Glu Cys Pro Glu Pro Arg His Arg Asp Asp Lys Thr 
770 775 780 
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VERIFICATION SUMMARY DATE: 06/14/2001 

PATENT APPLICATION: US/09/871,388 TIME: 10:25:53 

Input Set : N:\Crf3\RULE60\09871388.txt 
Output Set: N:\CRF3\06142001\l871388.raw 

L:25 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:26 M;220 C: Keyword misspelled or invalid format, [ (B) FILING DATE:] 
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